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(54) A name service for a redundant array of internet servers 

(57) A self-reconfiguring name service that distrib- 
utes workJoad among the available servers in a system. 
A Service Monitor for each host system of a site period- 
ically broadcasts information about available servers. 
The broadcast message also indicates the workload of 
the host. Each name service (DNS) has an associated 
process called a Name Binder Modifier that receives the 
broadcast messages from the Service Monitors. Period- 
ically, each Name Binder Modifier reviews the informa- 
tion it has received from various Service Monitors and 
updates zones that are used by the DNS to perform 
load balancing. The Service Monitor and the Name 
Binding Modifier have associated configuration files. 
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This application relates to distributed computing 
using the client/server model and. more specifically, to 
an improved name service for a client/server system. 

RAf pKGROUN n OF THE INVENTION 

In recent years, the internet has become extremely 
popular. Using the internet, a group of users in widely 
scattered locations can retrieve information located at a 
centralized site. The site, which may offer one or more 
"services" (also called "resources"), is connected to the is 
internet through a high speed connection. A user can 
access the site via the internet to read electronic mail, 
read news via Usenet, or view World Wide Web pages. 
As the speed of the connections between the user and 
the site improves, more users will be attempting to 20 
access the site's resources at a given time. 

Internet services in a site are accessed via the 
servers of the site. Multiple servers can provide access 
to a single service. Similarly, a single server can provide 
access to multiple services. 25 

Resources on the internet are accessed via names. 
A resource name typically contains multiple name com- 
ponents. The first name component is a domain name. 
This domain name typically refers to one or more hosts 
and conventionally is resolved through the Domain 30 
Name Service (DNS) of the site. An implicit or explicit 
second name component identifies the communications 
endpoint or server that will be used to access the 
resource on a particular host. The remaining name 
components are processed by the identified server to 35 
access the specified resource on the located host. 

For example, a WWW hypertext document is 
named by a Uniform Resource Locator (URL), such as: 

http:/ /www.sun. com/welcome. html. 
To access this document, the domain name component. *o 
www.sun.com, is first resolved to locate a host. Then, 
the hypertext transfer protocol (http) name component 
identifies a communications endpoint on the located 
host. An http server communicates with the client 
through this endpoint and retrieves the document spec- «5 
ified by the remaining component of the name, /wel- 
come.html. The File Transfer Protocol (FTP) service 
uses similar URLs. 

Usenet news articles are named in a similar way. To 
obtain a news article, the news reader client resolves a so 
domain name to locate a news server. It then communi- 
cates with the news server using a well-known endpoint 
to retrieve a particular article on the news server. 

Many internet systems include a process called a 
"name service" that examines the type of service ss 
required by each incoming request and returns a list of 
addresses of servers that can handle the request. It is 
the responsibility of the name service to balance the 



such a way that work is spread evenly betweertftne , „ 
servers. \ 

In a redundant server system, a site has more then*.. J 
one server. These servers collectively serve the entire 
user population. A problem arises when certain servers 
are performing more than their share of the workload of 
incoming user tasks. Some servers are operating at or 
near their capacity while other servers thai could be 
sharing the workload remain idle. Such a system is said 
to have an "unbalanced" workload. It is desirable to bal- 
ance the workload of the servers as much as possible. 

In addition, servers go in and out of service. 
Because any server can serve any user, the workload of 
a failed server can be assigned to other working serv- 
ers. A conventional redundant server system etlects 
load balancing using a "round robin" scheme to assign 
incoming requests to servers. As each request is 
received, the DNS service assigns a next server of the 
available servers. A round robin scheme for load bal- 
ancing is not always satisfactory because the name 
service does not always have up-to-date information 
about which servers are actually available. For example, 
a server may have failed or may have been added to the 
system without the name service being notified. Con- 
ventional name services often rely on a human being to 
reconfigure the knowledge that the name service has 
about the system. 

A site installs one or more internet servers to gain 
access to the internet and its services. The primary goal 
is to enable a site to deploy its internet servers in a 
highly "scalable" and "reliable" manner. A "sealable 
deployment" architecture allows the site to support a 
large number of users and to increase capacity grace- 
fully as the number of users increases. A "reliable" 
architecture minimizes the impact of component fail- 
ures. It may mask failures by redistributing the workload 
of failed components to working components. 

SUMMARY Q P THF INVENTION 

The present invention overcomes the problems and 
disadvantages of the prior art by implementing a self- 
reconfiguring name service that distributes workload 
among the available servers in a system. A Service 
Monitor for each host system of a site periodically 
broadcasts information about available servers. As new 
servers are added to the host, they are announced by 
way of this broadcast message. The broadcast mes- 
sage also indicates the workload of the host. 

Each name service has an associated process 
called a Name Binder Modifier that receives the broad- 
cast messages from the Service Monitors. Periodically, 
each Name Binder Modifier reviews the information it 
has received from various Service Monitors. The Name 
Binder Modifier maintains a list for each service of avail- 
able servers. For each service, servers executing on a 
host whose workload exceeds a predetermined work- 
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load value are deleted from a list of available servers. 
The list of available servers for each service contains at 
least a minimum number of servers whenever possible, 
however. 

The Name Binder Modifier periodically updates a 
plurality of zone files for respective services in accord- 
ance with its lists to indicate which servers are available 
to have work routed to them. The Name Binder Modifier 
assumes that servers which have not broadcast a mes- 
sage within an update interval are no longer available. 
The DNS service loads information from the updated 
zone files for use in its routing scheme. 

Each Service Monitor has an associated configura- 
tion file specifying, for each service, a time interval after 
which the Service Monitor should send a broadcast 
message. Furthermore, each Name Binder Modifier has 
a configuration file specifying, for each service, a time 
interval after which the zone files should be updated in 
accordance with the broadcast messages, a minimum 
number of servers for the service, and a maximum 
desirable workload per server. 

In accordance with the purpose of the invention, as 
embodied and broadly described herein, the invention is 
a method for reconfiguring a load balancing system, 
comprising the steps, performed by a data processing 
system, of sending, by a Service Monitor, for a service 
of the data processing system, a broadcast message 
containing a workload of a host on which the service is 
located and a list of addresses of the host; receiving, by 
a Name Binding Modifier, the broadcast message; and 
updating, by the Name Binding Modifier, a zone file in 
accordance with the information in the broadcast mes- 
sage. 

Objects and advantages of the invention will be set 
forth in part in the description which follows and in part 
will be obvious from the description or may be learned 
by practice of the invention. The objects and advan- 
tages of the invention will be realized and attained by 
means of the elements and combinations particularly 
pointed out in the appended claims. 

DETAILED DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

The accompanying drawings, which are incorpo- 
rated in and constitute a part of this specification, illus- 
trate several embodiments of the invention and, 
together with the description, serve to explain the princi- 
ples of the invention. 

Fig. 1 is a block diagram of a computer system in 
accordance with a preferred embodiment of the present 
invention. 

Fig. 2 shows an example of a host system having a 
plurality of servers and services. 

Fig. 3 is a flow chart of steps performed by a Serv- 
ice Monitor of Fig. l to send a broadcast message. 

Fig. 4 shows an example of a format of a broadcast 
message. 



Fig. 5 is a flow chart of steps performed by a Name 
Binding Modifier of Fig. 1 when it receives a broaccas: 
message. 

Figs. 6(a) and 6(b) are flow charts of steps per- 
5 formed by tne Name Binding Modifier when a timer for a 
service expires. 

Fig. 7 shows an example of a configuration file for a 
Service Monitor. 

Fig. 8 shows an example of a configuration file for a 
io Name Binder Modifier. 

Fig. 9 shows an example of a configuration file for a 
DNS service. 

Fig. 10 shows an example of several previous and 
next lists in the Name Binding Modifier. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

Reference will now be made in detail to the pre- 
ferred embodiments of the invention, examples of which 
20 are illustrated in the accompanying drawings. Wherever 
possible, the same reference numbers will be used 
throughout the drawings to refer to the same or like 
parts. 

Fig. 1 is a block diagram of a computer system in 
25 accordance with a preferred embodiment of the present 
invention. A client 130 communicates with server 140 
over line 106, which can be. for example, a U\N, a 
WAN, or an internet connection. Line 106 can also rep- 
resent a wireless connection, such as a cellular network 
30 connection. Server 140 includes a first computer 110 
and a second computer 120. First computer 110 and 
second computer 120 are connected together via line 
107. 

First computer 1 10 includes a CPU 102; a memory 

35 104; input/output lines 105; an input device 160, such 
as a keyboard or mouse; and a display device 1 50. such 
as a display terminal. First computer 110 further 
includes an input device 161 for reading a computer 
usable medium 162 having computer readable program 

40 code means embodied therein. Input device 161 is. for 
example, a disk drive. 

Memory 104 of first computer 110 includes two 
server processes (also called "daemons"): Domain 
Name Server (DNS) server 170 and Name Binding 

45 Modifier 176. Name Binder Modifier 176 runs on every 
system that has a DNS server. Name Binder Modifier 
server 176 has an associated configuration file 1 78 and 
one or more associated header files 172 (one per zone 
file). DNS server 170 has a plurality of zone files 173. 

so 174. Each zone file corresponds to a service available 
on the system. 

A preferred embodiment of the present invention 
includes a second DNS server (not shown) to provide 
redundancy. A person of ordinary skill in the art will 

55 understand that memory 104 also contains additional 
information, such as application programs, operating 
systems, data, etc., which are not shown in the figure for 
the sake of clarity. 
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Modifier 175 searches current list 1004 of the service 
ID. If the address is not found in the current list in step 
510, then a new entry is added to the current list in step 
512. The new entry contains the current address and 
the current workload (from the broadcast message). If 
the address in the message is found in the current list in 
step 510. then the workload value from the broadcast 
message is used to update the found entry in step 514. 
Thus, the current list for the service !D contains the cur- 
rent workloads and addresses of hosts providing availa- 
ble services. Note that broadcast messages can be 
received from multiple Service Monitors. 

Figs. 6(a) and 6(b) are flow charts of steps per- 
formed by Name Binding Modifier 176 of Fig. 1. Initially. 
Name Binding Modifier 176 initializes respective system 
timers for each service (not shown). When one of the 
timers times out in step 602. Name Binding Modifier 1 76 
locates the current and previous lists for the service in 
step 604. In step 606. the current list is sorted by load. 
In step 608. Name Binding Modifier 176 counts the 
number of entries in the current list (N bh ) that contain 
workload values below a desired load index. The 
desired load index is determined in accordance with the 
configuration file of Fig. 8. In step 610, if N b i; is greater 
than the minimum number of entries in the current list, 
then there are more than enough entries in the list that 
contain load values less than the load index and. in step 
612. the current list is pruned of entries having a work- 
load value above the workload index (i.e.. so that N^ 
entries remain). If not. in step 614. the current list is 
pruned so that the minimum number of entries remain. 
After steps 612 and 614, control passes to Fig. 6(b). 

In step 620 of Fig. 6(b), the host addresses of the 
current and previous lists for the service are compared. 
If the two lists are not the same, then it is necessary to 
modify the zone files in step 624. Step 624 creates a 
temporary output file. Header file 172 is copied to the 
output file. Next, the address records for the addresses 
in the current list are written to the output file. Lastly. 
Name Binder Modifier 176 signals ONS server 170 to 
reload a zone file for the service from the output file.* 
DNS server 1 70 will now have up-to-date information on 
the state of the system. In step 626. the current list for 
the service ID becomes the previous list and the current 
list is made empty. 

If. for example, there are four available news serv- 
ers (one host address each) and the maximum load for 
each service is 60%, the minimum number of servers 
for each service is two. and the services are advertising 
the following loads: 

10%: newsl 
65%: news2 
70%: news3 
80%: news4 

Only entries for news3 and news4 are removed from the 
current list. Even though the host of news2 has a work- 



load value greater than 60%. removing news2 from the 
list would leave less man two entries in the lis;, so 
news2 is not removed. For a given service, proper oper- 
ation requires that the update interval used by Service 

5 Monitor 180 be of much shorter duration (half or less) 
than the update interval used by Name Binder Modifier 
176. Otherwise, it would be possible that not all availa- 
ble services would send a broadcast message during 
each update interval of Name Binder Modifier 175. 

\o Fig. 7 shows an example of a configuration file 1 82 
for Service Monitor 180. The configuration file includes 
three types of statements: "service'* statements, "sam- 
ple" load statements, and "exclude" statements. A 
'service" statement 702, which specifies the service 

i5 available at a site, includes the term "service" followed 
by three arguments. A first argument (e.g.. "60") is the 
update interval for the service (also called the "refresh 
interval") or the "heartbeat interval.") in a time unit such 
as seconds. The update interval is specified in time 

20 units, such as seconds. For example, in Fig. 7, the dura- 
tion of the http timer is 60 seconds and the duration of 
the news timer is 120 seconds. 

A second argument (e.g.. "http") is the name of a 
service. A third argument (e.g.. "80") is a communica- 

25 tions endpoint used to verify the availability of the serv- 
ice on a host. In the described embodiment, this 
communication endpoint is a TCP port number. 

When a service is initialized. Service Monitor 180 
obtains an update interval for the service from the con- 

30 figuration file. A broadcast message is sent by the serv- 
ice at the expiration of each update interval if it is 
determined that the service is available on the host. At 
initialization, the service also obtains the communica- 
tions endpoint for the service in accordance with the 

35 configuration file. 

Thus, the example configuration file of Fig. 7 corre- 
sponds to the servers of Fig. 2. In the configuration file, 
the update interval for the service http is 60 time units 
(e.g.. 60 seconds) and the update interval for the serv- 

40 ice news is 120 time units (e.g., 120 seconds). The http 
service uses port 80 and the news service uses port 
119. 

A "sample" load statement 704 specifies how fre- 
quently the load on a host should be measured. In this 

45 example, the sampling interval is 120 seconds. In the 
current implementation, only CPU utilization is meas- 
ured. Various implementations may use any various 
known methods of measuring load, in one implementa- 
tion, the configuration file contains the name of a routine 

50 to be used to measure load. 

An "exclude" statement eliminates certain host 
addresses and subnets from being advertised. This 
exclusion reduces the number of broadcast messages if 
certain addresses should not participate in providing 

55 services. 

Other implementations may provide support for 
UDP based services and more sophisticated per serv- 
ice liveliness test and load measurements. Service 
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20 
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• A method for reconfiguring a , oad ba , ancj 
tern, comprising the steps, performed by a da a 
processing system, of: y ° a ' a 

sending, by a Service Monitor, for a service of 
he data processing system, a broadcast mes 
sage containing a workioad of a host on which 
the serv,ce,s located and a,, s, of addresses of 

receiving by a Name Binding Modify tne 
broadcast message; and 
updating, by the Name Binding Modifier, a zone 

wZ7 S f 0nd ' n9 '° ' he S6rvice in acc °*-ce 
w,th the information in the broadcast message. 

The method of claim 1, further including the step of 
perfc.rm.ng a load ba.andng operation n accord 
ance w.th the updated zone file 

SraTon^, 0 ' ^ ,he ' 0ad °^cing 

operation ,s a round robin load balancing operation 

toad ST* °' C ' aim 3 ' Wh6rein the ™"d "bin 
load balancing operation is performed in ,rZ* 

ance with a value in the zone ,5. SngY,£S 

load balancng operation. W 

5. Je method of claim 1 . further including a plurality 
STs^^^ 

6. The method of claim 1. 

further including the step of setting, by the 
Serv,ce Mon.tor. a timer for the service and 

when 2 i^" SendinQ Step is pe *™ed 
when the timer times out. 

7. method of claim 6. wherein the step of settinc 
the timer 1S performed in accordance with a coTn 
uration file for the Service Monitor ^ 

6. The method of claim 1, 

nZI'vT" 9 ' he S,ep of by the 

Name B.ndmg Modifier, a timer for the service 

wherein the updating step is performed 
when the timer times out. earned 

1 ( ? o e , me,h0d of claim 8- therein the step of sertino 
'he timer ,s performed in accordance with a C0 Tn 
-anon file for the Name Binder Mocftfier 9 " 
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determining if the service is available; and 
if the service is available, adding to the broad- 
cast message the addresses that can provide 
the service. 

5 

11. The method of claim 1. wherein the updating step 
includes the steps of: 

finding a current list for the service; 
determining whether each address in the list of \c 
addresses is in the current list; and 
adding the address to the current list in accord- 
ance with the determining step. 

12. The method of claim 1 1 , 15 

further including the step of comparing the cur- 
rent list to a previous list. and. 

wherein the updating step is performed 
when the current and previous lists are not the 20 
same. 

13. An apparatus in a reconfiguring a load balancing 
system, comprising: 

25 

a Service Monitor configured to send, for a 
service in a data processing system, a broad- 
cast message containing a workload of a host 
on which the service is located and a list of 
addresses of the host: 30 
a Name Binding Modifier configured to receive 
the broadcast message; and 
an updating portion configured to update a 
zone file corresponding to the service in 
accordance with the information in the broad- 35 
cast message. 

14. The apparatus of claim 13, further including a 
Domain Name Service configured to perform a load 
balancing operation in accordance with the updated <o 
zone file. 

15. The apparatus of claim 14, wherein the Domain 
Name Service performs a round robin load balanc- 
ing operation. 45 

16. The apparatus of claim 15. wherein the Domain 
Name Service performs the round robin load bal- 
ancing operation in accordance with a value in the 
zone file indicating a type of load balancing opera- so 
tion. 

17. The apparatus of claim 13. further including a plu- 
rality of zone files, each corresponding to one of a 
plurality of services. 55 

18. The apparatus of claim 13. wherein the Service 
Monitor includes: 



a timer setting portion configured to set a timer 
for the service: and 

a portion configured to send the broadcast 
message when the timer times out. 

19. The apparatus of claim 18. wherein the timer set- 
ting portion sets the timer in accordance with a con- 
figuration file for the Service Monitor. 

20. The apparatus of claim 13. wherein the Name Bind- 
ing Modifier includes: 

a timer setting portion configured to set a timer 
for the service; and 

a portion configured to update the zone file 
when the timer times out. 

21. The apparatus of claim 20. wherein the timer set- 
ting portion sets the timer is in accordance with a 
configuration file for the Name Binding Modifier. 

22. The apparatus of claim 14, wherein the Service 
Monitor includes: 

a portion configured to determine whether the 
service is available: and 
a portion configured to add to the broadcast 
message, if the service is available, the 
addresses that can provide the service. 

23. The apparatus of daim 13. wherein the Name Bind- 
ing Modifier includes: 

a portion configured to find a current list for the 
service: 

a portion configured to determine whether 
each address in the list of addresses is in the 
current list; and 

a portion configured to add the address to the 
current list in accordance with the determining 
portion. 

24. The method of claim 23. further including a portion 
configured to compare the current list to a previous 
list, and 

wherein the updating portion includes a por- 
tion that updates the zone file when the current and 
previous lists are not the same. 

25. A computer program product, comprising: 

a computer usable medium having computer 
readable code embodied therein for reconfigur- 
ing a load balancing system, the computer pro- 
gram product comprising: 
computer readable program code devices con- 
figured to cause a computer to effect sending, 
for a service of the data processing system, a 
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broadcast message containing a workload of a 

host on which the serv,ce is located and a list of < j 

addresses of the host: 

computer readable program code devices con- 
figured to cause a computer to effect receiving s 
the broadcast message; and 
computer readable program code devices con- 
figured to cause a computer to effect updating 
a zone file corresponding to the service in 
accordance with the information in the broad- ,o 
cast message. 

26. A method for reconfiguring a load balancing sys- 
tem, comprising the steps, performed by a Service 
Monitor in a data processing system, of: 

setting a timer for the service of the data 
processing system, in accordance with a con- 
figuration file for the Service Monitor; and 
sending, for the serv.ce, a broadcast message 20 
containing a workload of a host on which the 
service is located and a list of addresses of the 
host, the broadcast message being usable to 
update a record of a current status of the data 
processing system. 

27. The method of claim 26, wherein the sending step 
includes the steps of: 

determining if the service is available* and 
if the service is available, adding to the broad- 
cast message the addresses that can provide 
the service. 

28. A method for reconfiguring a load balancing sys- 
tem, comprising the steps, performed by a Name 
Binder Modifier of a data processing system, of: 

receiving a broadcast message containing a 
workload of a host on which a service is <o 
located and a list of addresses of the host- 
setting a timer for the service; and 
updating a zone file corresponding to the serv- 
ice. when the timer times out. in accordance 
with the information in the broadcast message. 
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29. The method of claim 28, further including the step 
of performing a load balancing operation in accord- 
ance with the updated zone file. 

30. The method of claim 29, wherein the step of setting 
the timer is performed in accordance with a confiq- 
uration file for the Name Binder Modifier 



45 



50 



55 



8 



> 



EP 0 81 7 020 A2 



100 
\ 



130 




/ 



140 



110 



-r 



106 



120 



■i- 




9 



EP0 817 020 A2 




8NS0OC!0 <£P 08i70?OA2> 



10 



EPO 817 020 A2 



302 



Tim©-out for 
Service 10? 



304 



306 



1 


Y 


Determine whether 
service is available 







308 




Fig. 3 

Send Broadcast 310 ^ 

Message 
(Service Monitor) 



Send Broadcast message 
containing: 

service ID, 

load, 

list of addresses 



Reset Timer 



11 



EPO 817 020 A2 



Service ID 



Host Workload 



List of Host Addresses 
(Excluding those in Exclude Statement) 



Fig. 4 



12 



EP 0 817 020 A2 



512 



504 




Find current list for 
service ID 



506 



Loop for each 
address in 
message 



508 



Look up address in 
current list 



510 



Add new entry to 
list for address 




Fig. 5 
Receive 
Broadcast 
Message 
(Name Binder 
Modifier) 



Update load for list 
entry 



Loop limit 
(start or end) 



r 



516 



13 



EP 0 81 7 020 A2 




Locate current and 
previous lists for 
the service 



602 



604 



Fig. 6(a) 
At End of Every 
Timeout Interval 
(Name Binder 
Modifier) 



Sort current list by 
load 



612 



Find number of 
entries in current 
list below desired 
load index (N M ) 



606 



608 



Prune current list 
so that minimum 
number of entries 
remain 




614 



14 



EP 0 81 7 020 A2 



620 



622 



624 



626 



Fig. 6(b) 




Compare address in 
each entry of current 
and previous lists 




Update Zone File: 
-Create output file; 
-Open output file; 
-Copy header file to 

output file 
-Write address 
records for 
addresses in 
current list; 
-Signal DNS service 
to reload zone file 
from output file 



Current list becomes 

previous list; 
Make current list 

empty 



15 



EP 0 81 7 020 A2 



182 




service 60 http 80 
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dns-update 120 http www. isg.sun.com. header www.isg.sun. com. cache 1 60 
dns-update 240 news news. isg.sun.com. header news. isg.sun.com. cache 2 60 
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